LZAC Lossless Data Compression
نویسنده
چکیده
This paper presents LZAC, a new universal lossless data compression algorithm derived from the popular and widely used LZ77 family. The objective of LZAC is to improve the compression ratios of the LZ77 family while still retaining the family’s key characteristics: simple, universal, fast in decoding, and economical in memory consumption. LZAC presents two new ideas: composite fixed-variable-length coding and offsetdifference coding. Composite fixed-variable-length coding combines fixed-length coding and variable-length coding into a single coding scheme. A pre-calculated threshold determines when to use one coding scheme over the other. Offset-difference coding improves the coding efficiency by coding the difference between the offset and the match length instead of the offset when variable-length coding in a non-lookahead case is used. When variable-length coding is used in a lookahead case, offset-difference coding improves the coding efficiency by coding the difference between the match length and the offset instead of the match length. The literal case codeword of LZAC, <0, literal>, remains the same as that of LZSS. However, the copy case codeword is different. LZAC presents new multiple copy case codewords. When fixed-length coding is used, the codeword is <1, length, offset>. When variable-length coding is used in a non-lookahead case, the codeword is <1, 0, length, offset-difference>. When variable-length coding is used in a lookahead case, the codeword is <1, 0, 0, offset-difference , offset>. LZAC also creates two new coding tables, λ1 and λ2, for variable-length coding. While the λ1 coding table is used to code the length, offset, and offset-difference , the λ2 coding table is used to code the offset-difference. The two new concepts, composite fixed-variable-length coding and offsetdifference coding, enable LZAC to further improve the LZ family’s compression ratios without using the complex and slow adaptive arithmetic coding or Huffman coding. The Calgary corpus was used to evaluate LZAC’s performance. The results show that LZAC achieves a non-weighted average compression ratio of 2.94 bits/char in an 8-kbyte window which surpasses the performance of any other compression algorithm in the LZ77 and LZ78 families.
منابع مشابه
Lossless Microarray Image Compression by Hardware Array Compactor
Microarray technology is a new and powerful tool for concurrent monitoring of large number of genes expressions. Each microarray experiment produces hundreds of images. Each digital image requires a large storage space. Hence, real-time processing of these images and transmission of them necessitates efficient and custom-made lossless compression schemes. In this paper, we offer a new archi...
متن کاملData Compression for Network GIS
Data compression algorithms can be categorized into lossless and lossy. Bit streams generated by lossless compression algorithm can be faithfully recovered to the original data. If loss of one single bit may cause serious and unpredictable consequences in original data (for example, text and medical image compression) lossless compression algorithm should be applied. If data consumers can toler...
متن کاملComparison of Different Methods for Lossless Medical Image Compression
Here, the concept of compression theory and lossless image compression methods are studied. In order to preserve the value of diagnostic medical images, it is necessary to provide lossless image compression. Apart from practical reasons, there are often legal restrictions on the lossless medical image compression. Lossless data compression has been suggested for many space science exploration m...
متن کاملA Novel Real Time Algorithm for Remote Sensing Lossless Data Compression based on Enhanced DPCM
In this paper, simplicity of prediction models for image transformation is used to introduce a low complex and efficient lossless compression method for LiDAR rasterized data and RS grayscale images based on improving the energy compaction ability of prediction models. Further, proposed method is applied on some RS images and LiDAR test cases, and the results are evaluated and compared with oth...
متن کاملLossless and Lossy Data Compression
Data compression (or source coding) is the process of creating binary representations of data which require less storage space than the original data 7; 14; 15]. Lossless compression is used where perfect reproduction is required while lossy compression is used where perfect reproduction is not possible or requires too many bits. Achieving optimal compression with respect to resource constraint...
متن کامل